.1 : • 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



June 6, 2004, 18:30:43 ; Search time 3117 Seconds 

(without alignments) 
15699.161 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-941-945A-1 

1129 

1 catatgcaggcgaactcctg 



ccagtggaatccacttcgga 1129 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



3470272 seqs, 21671516995 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



6940544 



Database 



GenEmbl : ^ 

1 : gb_ba : ^ 

2: gb_htg:^ 

3: gb_in:* 

4 : gb_om : * 

5 : gb_ov : * 

6 : gb_pat : * 

7 ; gb_ph : ^ 

8 : gb_^pl : ^ 

9 : gb_pr : * 

10: gb_ro:^ 

11: gb_sts : * 

12: gb_sy:* 

13: gb_un:^ 

14: gb_vi:* 

15: em_ba:^ 

16: em_fun:'^ 

1 7 : em__hum : * 

18: em_in:* 

1 9 : em_mu : ^ 

2 0 : em_om : * 

21 : em_or : * 

22 : em__ov: * 

23: em_pat:* 

2 4 : em ph : ^ 

25: em_pl:-^ 

2 6 : em_ro : ^ 

27: em sts:^ 



St 



t 



z o : 


em 


un : ^ 


on. 

z y : 


em 


VI : * 


oU : 


em 


-J- » 1» 

htg hum:* 


31 : 


em 


_htg_inv : * 


Jz : 


em 


_htg_other : * 


0 J : 


em 


htg mus : * 


J 4 : 


em 


htg pin : * 


0 o ; 


em 


htg rod : * 


36: 


em 


_htg mam: * 


37: 


em 


_htg vrt : * 


38: 


em 


sy : * 


39: 


em 


htgo hum: ^ 


40: 


em 


_htgo mus : * 


41: 


em 


htgo other:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

a 
o 



Result Query 







0 cor e 


Match 


Length 


DB 


ID 


Description 




1 

X 


-L r z y 


100 


.0 


Jz 0 UoU 


1 


IAP00527 5J vjcr\Q£Jj\J5;^ 


AP005275 


Corynebac 




0 

£^ 


X X z, _? 


100 


.0 


-3 ^ y y 0 u 


b 


AX12/144 IkP ii^ffJC 


AX127144 


Sequence 




■J) 


± ± 


100, 


.0 




b 


AXlz / 145 


7\X127145 


Sequence 




A 




50, 


.0 




b 


AaIzIU / 3 J 


AX120753 


Sequence 




5 


564 


50, 


.0 


564 




Rni f^9ft7n 


BD162870 


Novel pol. 




6 


492. 6 


43, 


.6 


308750 


1 


AP005216 


AP005216 


Corynebac 




7 


363 


32, 


.2 


1156 


6 


AX773541 


AX773541 


Sequence 




8 


263 


23. 


.3 


1026 


6 


AX120754 


AX120754 


Sequence 




9 


263 


23. 


.3 


1026 


6 


BD162871 


BD162871 


Novel pol 




10 


258 


22, 


.9 


341553 


1 


BX248355 


BX248355 


Corynebac 




11 


133. 4 


11. 


,8 


874 


6 


AR199611 


AR199611 


Sequence 




12 


133. 4 


11. 


.8 


874 


6 


BD069092 


BD069092 


Nucleic a 


c 


13 


133.4 


11. 


,8 


14240 


1 


AE007157 


AE007157 


Mycobacte 


c 


14 


133.4 


11. 


,8 


316050 


1 


BX248346 


BX248346 


Mycobacte 


c 


15 


133.4 


11. 


8 


349606 


15 


BX842583 


Bx842583 Mycobacte 




16 


84 


7 . 


4 


300425 


1 


AP005041 


AP005041 


Streptomy 




17 


83.2 


7. 


4 


292100 


1 


SC0939121 


AL939121 


Streptomy 




18 


80.2 


7. 


1 


4692 


1 


SCAJ10601 


AJ010601 


Streptomy 


c 


19 


62.4 


5. 


5 


42325 


1 


U00015 


U00015 Mycobacteri 


c 


20 


62. 4 


5. 


5 


42325 


6 


AR345363 


AR345363 


Sequence 




21 


62. 4 


5. 


5 


344050 


1 


MLEPRTN2 


AL583918 


Mycobacte 


c 


22 


49.2 


4 . 


4 


2000 


6 


AX655393 


AX655393 


Sequence 


c 


23 


43 


3. 


8 


125020 


9 


AF429315 


AF429315 


Homo sapi 




24 


42.8 


3. 


8 


125020 


9 


AF429315 


AF429315 


Homo sapi 




25 


42 


3. 


7 


193584 


2 


AC124172 


AC124172 


Mus muscu 


c 


26 


41,8 


3. 


7 


98466 


2 


AF551762 


AF551762 


Mus muscu 


c 


27 


41.8 


3. 


7 


165036 


10 


AC126433 


AC126432 


\ Mus muscu 


c 


28 


41.8 


3. 


7 


191713 


10 


AC117241 


AC117241 


, Mus muscu 




29 


41.8 


3. 


7 


198325 


2 


AY211543 


AY211543 


Mus muscu 




30 


40.6 


3. 


6 


2000 


6 


AX655393 


AX655393 


Sequence 


c 


31 


40 


3. 


5 


314450 


1 


BX294147 


BX294147 


Pirellula 




32 


39. 4 


3. 


5 


277000 


1 


SCO939109 


AL939109 


Streptomy 




33 


38.8 


3. 


4 


5610 


6 


AX278007 


AX278007 


Sequence 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on : 



Title: 

Perfect score 
Sequence : 

Scoring table 



June 6, 2004, 18:25:39 ; Search time 387 Seconds 

(without alignments) 
12393.322 Million cell updates/sec 

US-09-941-945A-1 
1129 

1 catatgcaggcgaactcctg ccagtggaatccacttcgga 1129 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 3373863 seqs, 2124099041 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



6747726 



Database : 



N_Geneseq_2 9Jan04 : * 

1: geneseqnl980s : * 

2: geneseqnl990s : * 

3: geneseqn2000s : * 

4: geneseqn2001as : * 

5: geneseqn2001bs : ^ 

6 : geneseqn2002s : * 

7: geneseqn2003as : ^ 

8: geneseqn2003bs : ^ 

9 : geneseqn2003cs : 

10 : geneseqn2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 



, 


Score 


Match 


Length 


DB 


ID 


1 


1129 


100.0 


349980 


5 


AAH68525 If 11(^^0 


2 


1129 


100. 0 


349980 


5 


AAH68526 


3 


1125.8 


99.7 


1129 


6 


ABA96000 "be /06Y353i 


4 


564 


50.0 


564 


5 


AAH65634 


5 


564 


50.0 


564 


7 


ACA00076 


6 


263 


23.3 


1026 


5 


AAH65635 


7 


263 


23.3 


1026 


7 


ACA00077 


8 


133.4 


11.8 


1105 


2 


AAV44589 




D^escription 



Aah68525 
Aah68526 
7\ba96000 
Aah65634 
Aca00076 
Aah65635 
Aca00077 
Aav44589 



C glutami 
C glutami 
Corynebac 
C glutami 
C, glutam 
C glutami 
C, glutam 
Mycobacte 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic 
Run on: 



nucleic search, using sw model 
June 6, 2004, 22:19:59 



Title: 

Perfect score: 
Sequence ; 



US-09-941-945A-1 
1129 

1 catatgcaggcgaactcctg 



Search time 80 Seconds 
(without alignments) 
7831.744 Million cell updates/sec 



ccagtggaatccacttcgga 1129 



1365418 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 682709 seqs, 277475446 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patent s_NA: * 

1 : /cgn2_6/ptodata/2/ina/5A_COMB. seq: ^ 

2 : /cgn2_6/ptodata/2/ina/5B_COMB. seq: * 

3 : /cgn2_6/ptodata/2/ina/6A_COMB. seq: ^ 

4 : /cgn2_6/ptodata/2/ina/6B_COMB. seq: * 

5 : /cgn2_6/ptodata/2/ina/PCTUS_COMB. seq: * 

6 : /cgn2_6/ptodata/2/ina/backf ilesl .seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 




1 


133. 


4 


11.8 


874 


4 


US-09-082-920-1 


Sequence 1, Appli 


c 


2 


133. 


4 


11.8 


4403765 


3 


US-09-103-840A-2 


Sequence 2, Appli 


c 


3 


133. 


4 


11.8 


4411529 


3 


US-09-103-840A-1 


Sequence 1, Appli 


c 


4 


62. 


4 


5.5 


42325 


4 


US-08-311-731A-131 


Sequence 131, App 




5 


38. 


8 


3.4 


5610 


4 


US-10-204-708-54 


Sequence 54, Appl 




6 


37. 


2 


3.3 


813 


4 


US-09-252-991A-5906 


Sequence 5906, Ap 


c 


7 


37. 


2 


3.3 


1605 


4 


US-0 9-252-991A-5966 


Sequence 5966, Ap 


c 


8 


37. 


2 


3.3 


1623 


4 


US-09-252-991A-5984 


Sequence 5984, Ap 


c 


9 


36. 


6 


3.2 


684 


4 


US-09-252-991A-2524 


Sequence 2524, Ap 




10 


36. 


2 


3.2 


966 


4 


US-09-252-991A-2236 


Sequence 2236, Ap 


c 


11 


35. 


6 


3.2 


2016 


4 


US-09-252-991A-2445 


Sequence 2445, Ap 




12 


35. 


4 


3.1 


1429 


4 


US-08-879-337-2 


Sequence 2, Appli 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: June 6, 2004, 23:47:24 ; Search time 401 Seconds 

(without alignments) 
12844.123 Million cell updates/sec 

Title: US-0 9-94 1-94 5A-1 

Perfect score: 1129 

Sequence: 1 catatgcaggcgaactcctg ccagtggaatccacttcgga 1129 



Scoring table: IDENTITY_NUC 

Gapop 10.0 f Gapext 1.0 

Searched: 2995936 seqs, 2280998010 residues 



Total number of hits satisfying chosen parameters: 5991872 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Published_Applications__NA: 

1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB. seq: ^ 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW__PUB. seq: ^ 

3: /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB. seq: ^ 

4 : /cgn2__6/ptodata/2/pubpna/US06_PUBCOMB. seq: ^ 

5: /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB. seq: ^ 

6: /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB.seq: * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq: ^ 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB . seq: ^ 

9: /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB.seq: * 
10: /cgn2__6/ptodata/2/pubpna/US09B_PUBCOMB.seq: * 
11 : /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB. seq: * 
12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB.seq: ^ 
13 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB. seq2 : ^ 
14 : /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB. seq: * 
15: /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB.seq: ^ 
16: /cgn2_6/ptodata/2/pubpna/US10C_PUBCOMB. seq: * 
17 : /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB. seq: ^ 
18 : /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB. seq: * 
19: /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB. seq: ^ 



Fred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



Result 



SUMMARIES 



Query 





No . 


Score 


Match 


Length 


DB 


/ 




1 


112 9 


ICQ. 


.0 


112 9 


9 


US-09-941-945A-1 / 




2 


1129 


100, 


,0 


3309400 


9 


US-09-738-626-1-/ 




3 


564 


50, 


,0 


564 


9 


US-09-738-626-669 




/I 

4 


263 


23. 


.3 


1026 


9 


US-09-738-626-670 




5 


84 


7. 


.4 


9025608 


15 US-10-156-761-1 




6 


81.6 


7 . 


2 


573 


15 


US-10-156-7 61-4 983 




/ 


38 . 8 


3. 


4 


5610 


1 r" 

15 


US- 10-2 39- 67 6- 170 




o 

8 


38 . 8 


3. 


4 


5610 


15 


US-10-204-708-54 






38.8 


3. 


4 


5610 


15 


US-1 0-2 4 0-4 53-192 




10 


38 . 8 


3. 


4 


5610 


17 


US- 10-24 0-58 9C- 90 




11 


38.4 


3. 


4 


2320 


13 


US- 10- 02 7- 632 -2 64 352 




Iz 


38.4 


3. 


4 


2320 


16 


US-10-027-632-264352 


c 


13 


38.2 


3. 


4 


2655 


13 


US-10-2 82-122A-15 0 67 


c 


14 


37.8 


3. 


3 


2562 


13 


US-10-282-122A-31875 




15 


37 . 8 


3. 


3 


18011 


15 


US-10-311-455-8 




16 


36.6 


3. 


2 


1002 


16 


US- 10-3 6 9- 4 93-4 54 11 




1 / 


3 6.4 


3. 


2 


671 


15 


US-10-18 4-644-34 6 




18 


36.4 


3. 


2 


671 


15 


US-10-18 4-634 -34 6 


c 


1 y 


3 6.4 


3. 


2 


^ ^ 

737 


13 


US- 10-4 2 5-1 14 -27 636 


c 




35 . 6 


3. 


2 


453 


9 


US-09-938-84 2A-34 37 


c 


Z 1 


35.6 


3. 


2 


453 


11 


US-0 9-938-8 42A-34 37 


c 




35 . 6 


3. 


2 


1704 


13 


US-10-28 2-122A-30 6 61 




z3 


35 . 4 


3. 


1 


1429 


8 


US-08-879-337-1 




z 4 


35 . 2 


3. 


1 


144 9 


13 


US-10-4 24-5 99-137 657 




z5 


35 


3. 


1 


716 


13 


US-10-142-426-96 




z 6 


35 


3. 


1 


716 


15 


US-1 0-1 2 3- 15 5- 96 




z / 


35 


3. 


1 


716 


15 


US-10-146-731-96 




28 


35 


3. 


1 


716 


15 


US-10-140-472-96 




z y 


35 


3. 


1 


716 


15 


US-10-141-761-96 




30 


35 


3. 


1 


716 


15 


US-10-142-885-96 




J 1 


35 


3. 


1 


716 


15 


US-10-158-790-96 




32 


35 


3. 


1 


716 


16 


US-10-137-871-96 




33 


35 


3. 


1 


716 


16 


US-10-140-923-96 




Q /I 
o4 


35 


3. 


1 


716 


16 


US-10-141-756-96 




3o 


35 


3. 


1 


716 


16 


US-10-141-759-96 




36 


35 


3. 


1 


716 


16 


US-10-140-805-96 




37 


35 


3. 


1 


716 


16 


US-10-140-864-96 


c 


38 


35 


3. 


1 


4639 


15 


US-10-096-534-48 


c 


39 


34.8 


3. 


1 


745 


13 


US-10-424-599-142608 


c 


40 


34 . 8 


3. 


1 


771 


13 


US-10-194-163-667 




41 


34.8 


3. 


1 


1407 


13 


US-10-417-700A-76 




42 


34 . 6 


3. 


1 


723 


16 


US-10-369-493-42286 


c 


43 


34 . 6 


3. 


1 


1040 


13 


US-10-425-114-21645 




44 


34 . 6 


3. 


1 


1710 


16 


US-10-369-493-27553 


c 


45 


34 . 6 


3. 


1 


2199 


13 


US-10-282-122A-25416 



Description 



Sequence 1, Appli 
Sequence 1, Appli 

Sequence 669, App 

Sequence 670, App 
Sequence 1, Appli 
Sequence 4983, Ap 
Sequence 170, App 
Sequence 54, Appl 
Sequence 192, App 
Sequence 90, Appl 
Sequence 264352, 
Sequence 264352, 
Sequence 15067, A 
Sequence 31875, A 
Sequence 8, Appli 
Sequence 45411, A 
Sequence 34 6, App 
Sequence 34 6, App 
Sequence 27636, A 

Sequence 3437, Ap 
Sequence 3437, Ap 
Sequence 30661, A 

Sequence 1, Appli 
Sequence 137657, 
Sequence 96, Appl 
Sequence 96, Appl 
Sequence 96, Appl 
Sequence 96, Appl 
Sequence 96, Appl 
Sequence 96, Appl 
Sequence 96, Appl 
Sequence 96, 
Sequence 96, 
Sequence 96, 
Sequence 96, 
Sequence 96, Appl 
Sequence 96, Appl 
Sequence 48, Appl 
Sequence 142608, 
Sequence 667, App 
Sequence 76, Appl 
Sequence 42286, A 
Sequence 21645, A 
Sequence 27553, A 
Sequence 25416, A 



Appl 
Appl 
Appl 
Appl 



ALIGNMENTS 



RESULT 1 

-09-941-945A-1 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on : 



Title: 



June 6, 2004, 22:09:53 ; Search time 2427 Seconds 

(without alignments) 
13891.389 Million cell updates/s 



US-09-941-945A-1 
Perfect score: 1129 

Sequence : 1 catatgcaggcgaactcctg ccagtggaatccacttcgga 1129 

Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 27513289 seqs, 14931090276 residues 

Total number of hits satisfying chosen parameters: 55026578 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



EST: ^ 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 



em_estba : * 
em_esthum: * 
em_estin : * 
em_estmu : * 
em_estov : * 
em_estpl : ^ 
em^estro : ^ 
em_htc : * 
gb_estl : * 
gb_est2 : * 
gb_htc : * 
gb_est3 : ^ 
gb_est4 : * 
gb_est5 : * 
em_estf un : + 
em_estom: * 
em_gss_hum : * 
em_gss_inv : ^ 
em_gss_pln: ^ 
em_gss_vrt : * 
em_gss_f un : * 
em_gss_mam: ^ 
em_gss_mus : ^ 
em_gss_pro : * 
em_gss_rod: * 
em_gss_phg : 
em_gss_vrl : * 
gb gssl:* 



29: gb_gss2:^ 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 





1 




4 , o 


iZul 


13 


BX361080 


BX361080 BX361080 




9 


^ 0 . z 


4 . U 


1207 


2 9 


CNS015ZN 


AL106109 Drosophil 






A 1 


o . b 


o o c; 


13 


BX425 603 


BX425603 BX425603 






Q ft 


9 c; 


Iz Ul 


1 'D 

13 


BX364 960 


BX364960 BX364960 




o 


J y . z 


J . D 


IzOl 


13 


BX381961 


BX381961 BX381961 








9 


/ i Z 


1 J 


BX4 1 D / z / 


BX416727 BX416727 




1 


^ Q 


J . 0 


1 lUl 


z 9 


CNSOOLOO 


AL068607 Drosophil 


C 


Q 
O 


o y 


3 . 0 


1101 


2 9 


CNS00ZB7 


AL097453 Drosophil 




q 


oo . b 


o . 4 


1101 


29 


CNS00LT2 


AL078714 Drosophil 




1 n 

-L U 


J c5 . Z 


T /I 


one 

3yb 


13 


BY224 94 9 


BY224949 BY224949 




1 1 




J . 4 


Iz; Ul 


13 


BX35 6004 


BX356004 BX356004 






■^7 


O . J 


izUl 


13 


BX38 1 961 


BX381961 BX381961 




J. J 


o / , U 


J , O 


o n "7 o 

zlU /z 


1 1 


AK016917 


AK016917 Mus muscu 




1 4 


'^7 


9 9 


zz 4 o 


1 1 
1 1 


AKOz 9525 


AK029525 Mus muscu 




X J 


^7 4 


9 ^ 


Q ii: 
0 D 0 


zi y 


CG / / 1362 


CG771362 TCB47.2 F 




X u 


^7 9 
J / . Z 


o . J 


/I Q C 
4 J3 


1 A 

1 4 


W96254 


W96254 ze42b01.rl 


C 


1 7 
X / 


9 7 0 
J / . Z 


J . 3 


y1 / ^ 

4 4 4 


9 


AA01827 9 


AA018279 ze53h06.r 




1 ft 
X o 


"^7 9 
O / . Z 


O . -J 


4 bU 


y 


AA0182 67 


AA018267 ze53g07.r 




X y 


"^7 9 
o / . z 


9 9 


4 b4 


y 


AA054 016 


AA054016 zf48h07.r 


c 


z u 


O / , Z 


J . J 


4 93 


9 


AA01338 4 


AA013384 ze28bl2.r 




9 1 

Z X 


O / . Z 


3 . 3 


d /A J— 
o05 


14 


H8 6191 


H86191 ys94g06.rl 




9 9 
Z Z 


^ 7 O 
O / . Z 


3 . 3 


o r\ T 

807 


12 


BG321006 


BG321006 Zm04 03el 




9 ^ 
z J 


O / , Z 


3 . 3 


f~i ~j 
887 


12 


BI096866 


BI096866 SCUMtigll 




9 A 
Z 4 


o / . z 


3 . 3 


933 


28 


BZ554206 


BZ554206 pacsl-60 




9 R 

z o 


9 7 O 
J / . Z 


"5 'a 
o . J 


104 5 


13 


BX386287 


BX386287 BX386287 




9 

Z D 


9 7 0 
J / . Z! 


J . O 


lUb 6 


28 


BZ550840 


B2550840 pacsl-60 


c 


97 
z / 


9 7 


3 , 3 


■o o n 

38 9 


14 


H84094 


H84094 ys94fll.rl 




9ft 
z o 


97 


o . J 


4 00 


1 4 


H84088 


H84088 ys94el2.rl 


c 


29 


37 


3.3 


427 


9 


AA056119 


AA056119 zf55g07.r 


c 


30 


37 


3.3 


995 


13 


BX404378 


BX404378 BX404378 




31 


36.8 


3.3 


1101 


29 


CNSOOFCQ 


AL070525 Drosophil 


c 


32 


36.6 


3,2 


421 


28 


AQ234900 


AQ234900 HS 2054 A 


c 


33 


36.6 


3.2 


475 


10 


BF412036 


BF412036 UI-R-BTl- 




34 


36.6 


3.2 


531 


12 


BP135277 


BP135277 BP135277 


c 


35 


36.6 


3.2 


704 


29 


AG171291 


AG171291 Pan trogl 




36 


36.6 


3.2 


884 


29 


CNSO06U0 


AL065923 Drosophil 




37 


36. 6 


3.2 


947 


28 


BZ570083 


BZ570083 msh2 1157 




38 


36.4 


3.2 


289 


10 


AW372518 


AW372518 PM3-BT034 


c 


39 


36.4 


3.2 


541 


28 


AQ611746 


AQ611746 HS 5087 B 


c 


40 


36.4 


3.2 


565 


28 


B68684 


B68684 CIT-HSP-205 


c 


41 


36.2 


3.2 


442 


14 


CF485590 


CF485590 POLl 32 A 


c 


42 


36.2 


3.2 


552 


29 


CG335625 


CG335625 OG3BN04TV 




43 


36.2 


3.2 


559 


29 


CG335610 


CG335610 OG3BN04TH 


c 


44 


36.2 


3.2 


650 


29 


CC610253 


CC610253 OGUCS45TV 


c 


45 


36.2 


3.2 


706 


12 


BM338529 


BM338529 MEST228-C 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - nucleic search, using frame plus p2n model 



Run on: 



June 7, 2004, 00:31:14 



Search time 2368 Seconds 
(without alignments) 
3441.085 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-941-945A-2 
929 

1 MADTERELADLVPQATAGDR 



VAQHRALTTLRSTLEQQENK 188 



Scoring table: 



BLOSUM62 



Xgapop 10.0 , Xgapext 

Ygapop 10.0 , Ygapext 

Fgapop 6.0 , Fgapext 

Delop 6.0 , Delext 



0.5 

0.5 
7.0 
7.0 



Searched: 



3470272 seqs, 21671516995 residues 



Total number of hits satisfying chosen parameters: 



6940544 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-M0DEL=frame+_p2n. model -DEV=xlh 

Q-/cgn2_l/USPTO_spool/US0 994194 5/runat_03062004_164 702_52 97/app__query. fasta_l 
7 

-DB=GenEmbl -QFMT-fastap -SUFFIX=rge -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX-blosum62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR__MIN=0 -ALIGN=15 -MODE=LOCAL 
-OUTFMT^pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 
-QSER=US09941945_@CGN_l_l_2527_@runat_030 620 04_1 64 702^52 97 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 
-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP-10 -XGAPEXT=0 . 5 -FGAPOP= 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT^O . 5 -DEL0P=6 -DELEXT=7 

Database : GenEmbl : ^ . 



1: 
2: 
3: 
4 : 
5: 
6: 

7 : 

8 : 
9: 
10 



gb_ba : ^ 
gb_htg : * 
gb_in : ^ 
gb om:^ 
gb_ov : * 
gb_pat : * 
gb_ph : * 
gb_pl : * 
gb_pr : * 
gb ro:* 



11: 


gb sts:* 




12: 


gb sy:^ 




13: 


gb un : * 




14: 


gb vi : ^ 




15: 


em ba : * 




16: 


em fun : ^ 




17: 


em hum : ^ 




18: 


em in:* 




19: 


em mu : * 




20: 


em om : * 




21: 


em or:* 




22: 


em ov:* 




23: 


em pat : * 




24: 


em ph:* 




25: 


em_pl : * 




26: 


em ro : * 




27: 


em sts:* 




28 : 


em un:* 




29: 


em vi : * 




30: 


em htg hum: 


•k 


31: 


em htg inv: 


■k 


32: 


em_htg other:* 


33: 


em htg mus : 


kr 


34 : 


em htg pin: 




35: 


em htg rod: 




36: 


em htg mam: 




37 : 


em htg vrt : 




38: 


em sy : * 




39: 


em htgo hum 




40: 


em_htgo mus 


: * 


41: 


em htgo other:* 



ance to have 



Pred. No. is the number of results predicted b 

score greater than or equal to the score ^ the result being printed, 
and is derived by analysis of the t^^Si^core distribution. 



SUMMARIES 



Result 




Query 










No. 


Score 


Match, 


/^Tength 


DB 


ID 




1 


926 






) 564 


6 


AX120753 




2 


926 


99. 


7 


564 


6 


BD162870 




3 


926 


99. 


7 


328050 


1 


|aP005275]' 




4 


926 


99. 


7 


349980 


6 


AX127144 




5 


926 


99. 


7 


349980 


6 


AX127145 




6 


850 


91. 


5 


308750 


1 


AP005216 




7 


657 


70. 


7 


341553 


1 


BX248355 




8 


473 


50. 


9 


874 


6 


AR199611 




9 


473 


50. 


9 


874 


6 


BD069092 


c 


10 


473 


50. 


9 


14240 


1 


AE007157 


c 


11 


473 


50. 


9 


316050 


1 


BX248346 


c 


12 


473 


50. 


9 


349606 


15 


BX842583 




13 


397.5 


42. 


8 


300425 


1 


AP005041 




14 


388 


41. 


8 


292100 


1 


SC0939121 




15 


385 


41. 


4 


4692 


1 


SCAJ10601 


c 


16 


320 


34 . 


4 


42325 


1 


U00015 



Description 



€P 
BP 



AX120753 Sequence 
BD162870 Novel pol 
AP005275 Corynebac 
7VX127144 Sequence 
AX127145 Sequence 
AP005216 Corynebac 
BX24 8355 Corynebac 
AR199611 Sequence 
BD069092 Nucleic a 
AE007157 Mycobacte 
BX248346 Mycobacte 

Bx842583 Mycobacte 
AP005041 Streptomy 
AL939121 Streptomy 
AJ010601 Streptomy 
U00015 Mycobacteri 



GenCore version 5.1,6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



June 7, 2004, 00:29:29 



Search time 301 Seconds 
(without alignments) 
2653.360 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09~941-945A-2 

929 

1 MADTERELADLVPQATAGDR 



VAQHRALTTLRSTLEQQENK 188 



Scoring table: 



BLOSUM62 



Xgapop 10.0 , 

Ygapop 10.0 , 

Fgapop 6.0 , 

Delop 6.0 , 



Xgapext 
Ygapext 
Fgapext 
Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



3373863 seqs, 



2124099041 residues 



Total number of hits satisfying chosen parameters: 



6747726 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-M0DEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool/US0994194 5/runat_03062004_1647 01_5287/app_query, fasta_l 
7 

-DB=N_Geneseq_29Jan04 -QFMT=fastap -SUFFIX=rng -MINMATCH=0 . 1 -LOOPCL=0 
-LOOPEXT-0 -UNITS=bits -START-1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi 
-LIST-45 -DOCALIGN=200 -THR__SCORE-pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 
-MODE=LOCAL -OUTFMT=pto -MORM=ext -HEAPSIZ.E=500 -MINLEN-0 -MAXLEN-2000000000 
-USER=US0994194 5_@CGN_l_l_352_@runat__03062004_164 701_5287 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -TSiEG_SCORES=0 -WAIT -DSPBLOCK-10 0 -LONGLOG 
-DEV_TIMEOUT-120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP= 
-FGAPEXT-7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP-6 -DELEXT=7 

Database : N^Geneseq 29Jan04:'^ 



1: 

2: 
3: 
4 : 
5: 
6: 
7: 
8: 
9: 
10 



geneseqnl980s : * 
geneseqnl990s : ^ 
geneseqn2000s : * 
geneseqn2001as : * 
geneseqn2001bs : ^ 
geneseqn2002s : * 
geneseqn2003as : * 
geneseqn2003bs : ^ 
geneseqn2003cs : * 
geneseqn2004s : 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

Q, 
O 

Result Query 



No. 


Score 


Match 


Length 


DB 


ID 




Description 


1 


926 


99. 


.7 


564 


5 


AAH65634 




Aah65634 


C glutami 


2 


926 


99. 


,7 


564 


7 


ACA0007 6 


DC ioiz,9Sto 


Aca00076 


C. glutam 


3 


926 


99. 


7 


349980 


5 


AAH68525 




Aah68525 


C glutami 


4 


926 


99, 


7 


349980 


5 


AAH68526 




Aah68526 


C glutami 


5 


920 


99. 


0 


1129 


6 


ABA96000 /OfiVSiP/ 


Aba96000 


Corynebac 


6 


473 


50. 


9 


1105 


2 


AAV44589 




Aav44589 


Mycobacte 


7 


473 


50. 


9 


110000 


4 


AAI99682 


38 


Continuation (39 o 


8 


473 


50. 


9 


110000 


4 


AAI99683~ 


_38 


Continuation (39 o 


9 


320 


34. 


4 


42325 


9 


ADB74382 




Adb7 4 382 


Mycobacte 


10 


153 . 5 


16. 


5 


564 


6 


ABK7 4 0 63 




Abk74063 


Bacillus 


11 


145.5 


15. 


7 


639 


5 


AAH66206 




Aah66206 


C glutami 


12 


145.5 


15. 


7 


648 


7 


ACA00415 




Aca00415 


C. glutam 


13 


145 . 5 


15. 


7 


1330 


6 


AAD35114 




Aad35114 


Corynebac 


14 


145.5 


15. 


7 


2086 


6 


AAD35115 




Aad35115 


Corynebac 


15 


145.5 


15. 


7 


349980 


5 


AAH68527 




Aah68527 


C glutami 


16 


135 . 5 


14 . 


6 


577 


6 


ABK74074 




Abk74074 


Bacillus 


17 


129 


13. 


9 


585 


7 


ABZ42002 




Abz42002 


N. gonorr 


18 


127 


13. 


7 


110000 


4 


AAI99682 


08 


Continuation (9 of 


19 


127 


13. 


7 


110000 


4 


AAI99683 


'08 


Continuation (9 of 


20 


123 


13. 


2 


37096 


3 


AAA81489 


^8 


Continuation (9 of 


21 


123 


13. 


2 


92934 


3 


AAA81473 




Aaa81473 


N. mening 


22 


123 


13. 


2 


172325 


3 


AAF21613 




Aaf21613 


Neisseria 


23 


122 


13. 


1 


13336 


4 


AAS59554 




Aas59554 


Propionib 


24 


122 


13. 


1 


13336 


7 


ACF64483 




Acf 64483 


Propionib 


25 


121 


13. 


0 


14615 


4 


AAS59577 




Aas59577 


Propionib 


26 


121 


13. 


0 


14615 


7 


ACF64506 




Acf64506 


Propionib 


27 


121 


13. 


0 


37856 


3 


AAA11992 




Aaall992 


S. cellul 


28 


119.5 


12. 


9 


38675 


9 


ADB74386 




Adb7 4 38 6 


Mycobacte 


29 


116.5 


12. 


5 


647 


2 


AAQ70249 




Aaq70249 


P. aerugi 


30 


116.5 


12. 


5 


647 


2 


AAQ70242 




Aaq70242 


P. aerugi 


31 


116.5 


12. 


5 


647 


3 


AAA51919 




Aaa51919 


P. aerugi 


32 


116.5 


12. 


5 


771 


6 


ABT05614 




Abt05614 


Mycobacte 


33 


116.5 


12. 


5 


771 


7 


ABZ71111 




Abz71111 


Mycobacte 


34 


116.5 


12. 


5 


75216 


6 


ABX09141 




Abx09141 


Mycobacte 


35 


116.5 


12. 


5 


110000 


4 


AAI99682 


13 


Continuation (14 o 


36 


116. 5 


12. 


5 


110000 


4 


AAI99683 


^13 


Continuation (14 o 


37 


116 


12. 


5 


603 


3 


AAA64868 




Aaa64868 


Bordetell 


38 


116 


12. 


5 


35026 


3 


AAA64890 




Aaa64890 


Bordetell 


39 


111 


11. 


9 


717 


8 


ADA30919 




Ada30919 


DNA encod 


40 


111 


11. 


9 


110000 


2 


AAT42063_ 


06^. 


Continuation (7 of 


41 


109 


11. 


7 


135638 


7 


ABX34289 


Abx34289 


S. atrool 


42 


108.5 


11. 


7 


582 


7 


AAD47841 




Aad47841 


Haemophil 


43 


108.5 


11. 


7 


603 


5 


AAH68381 




Aah68381 


C glutami 


44 


108.5 


11. 


7 


603 


7 


ACA012 97 




Aca01297 


C. glutam 


45 


108.5 


11. 


7 


1211 


6 


AAL42355 




Aal42355 


Corynebac 



ALIGNMENTS 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - nucleic search, using f raine_plus__p2n model 

Run on: June 7, 2004, 01:25:39 ; Search time 66 Seconds 

(without alignments) 
1580.769 Million cell updates/sec 

Title: US-09-94 1-945A-2 

Perfect score: 929 

Sequence: 1 MADTERELADLVPQATAGDR VAQHRALTTLRSTLEQQENK 188 



Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 

Searched: 682709 seqs, 277475446 residues 

Total number of hits satisfying chosen parameters: 1365418 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Command line parameters: 
-M0DEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool/US09941945/runat_03062004_164703_5323/app_query. fasta_l . 32 
7 

-DB=Issued__Patents_NA -QFMT=fastap -SUFFIX-rni -M1NMATCH=0 . 1 -LOOPCL-0 
-LOOPEXT=0 -UNITS^bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS-human4 0 . cdi 
-LIST-45 -DOCALIGN=200 -THR_SCORE-pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 
-MODE=LOCAL -OUTFMT-pto -NORM-ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN-^2 00 0000000 
-USER-US0 994194 5_@CGN_l_l_54_@runat_03062004_164 7 03__5 323 -NCPU=6 -ICPU=3 
-NO__MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK^lOO -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS-1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAP0P=6 
-FGAPEXT-7 -YGAPOP=10 -YGAPEXT=0.5 -DEL0P=6 -DELEXT=7 



Database : Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/2/ina/5A_COMB. seq: 

2 : /cgn2_6/ptodata/2/ina/5B_COMB. seq: * 

3: /cgn2_6/ptodata/2/ina/6A_COMB. seq: * 

4 : /cgn2__6/ptodata/2/ina/6B_COMB. seq: * 

5 : /cgn2_6/ptodata/2/ina/PCTUS_COMB . seq: ^ 

6 : /cgn2_6/ptodata/2/ina/backf ilesl . seq: 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result Query 





No. 


Score 


Match 


Length 


DB 


ID 


Description 




1 


473 


50, 


. 9 


874 


4 


US-09-082-920-1 


Sequence 1, Appli 


c 


2 


473 


50. 


. 9 


4403765 


3 


US-09-103-840A-2 


Sequence 2, Appli 


c 


3 


473 


50. 


. 9 


4411529 


3 


US-09-103-840A-1 


Sequence 1, Appli 


c 


4 


320 


34 . 


. 4 


42325 


4 


US-08-311-731A-131 


Sequence 131, App 




5 


139 


15. 


.0 


621 


4 


US-09-252-991A-1369 


Sequence 1369, Ap 




6 


127 


13. 


, 7 


4403765 


3 


US-09-103-840A-2 


Sequence 2, Appli 




7 


127 


13, 


, 7 


4411529 


3 


US-09-103-840A-1 


Sequence 1, Appli 




8 


121 


13. 


, 0 


597 


4 


US-0 9-252- 991A-13522 


Sequence 13522, A 




9 


119.5 


12. 


, 9 


38675 


4 


US-08-311-731A-135 


Sequence 135, App 




10 


116.5 


12, 


5 


647 


1 


US-08-260-202A-9 


Sequence 9, Appli 




11 


116.5 


12. 


5 


647 


1 


US-08-017-114-9 


Sequence 9, Appli 




12 


116. 5 


12. 


5 


647 


3 


US-08-505-307-9 


Sequence 9, Appli 




13 


116 . 5 


12. 


5 


647 


4 


US-09-609-151A-9 


Sequence 9, Appli 




14 


116.5 


12. 


5 


647 


5 


PCT-US94-02034-9 


Sequence 9, Appli 




15 


116.5 


12. 


5 


840 


4 


US-09-252-991A-13069 


Sequence 13069, A 




16 


113 


12. 


2 


600 


4 


US-09-543-681A-3510 


Sequence 3510, Ap 




17 


111 


11. 


9 


717 


4 


US-09-328-352-2206 


Sequence 2206, Ap 




18 


111 


11. 


9 


1830121 


4 


US-09-557-884-1 


Sequence 1, Appli 




19 


111 


11. 


9 


1830121 


4 


US-09-643-990A-1 


Sequence 1, Appli 


c 


20 


108 . 5 


11. 


7 


1830121 


4 


US-09-557-884-1 


Sequence 1, Appli 


c 


21 


108 . 5 


11. 


7 


1830121 


4 


US-09-643-990A-1 


Sequence 1, Appli 


c 


22 


104 


11. 


2 


1746 


4 


US-09-252-991A-8985 


Sequence 8985, Ap 




23 


103 . 5 


11. 


1 


4605 


4 


US-09-221-017B-128 


Sequence 12 8, App 




24 


99 . 5 


10. 


7 


549 


4 


US-09-252-991A-11688 


Sequence 11688, A 




25 


99 


10. 


7 


846 


4 


US-09-252-991A-9635 


Sequence 9635, Ap 


c 


26 


99 


10. 


7 


1062 


4 


US-09-252-991A-9532 


Sequence 9532, Ap 




27 


99 


10. 


7 


1371 


4 


US-09-252-991A-9717 


Sequence 9717, Ap 




28 


97 . 5 


10. 


5 


540 


4 


US-09-252-991A-15549 


Sequence 15549, A 




29 


96.5 


10. 


4 


1230025 


4 


US-09-198-452A-1 


Sequence 1, Appli 




30 


96 


10. 


3 


525 


4 


US-09-107-532A-1956 


Sequence 1956, Ap 




31 


95 


10. 


2 


1173 


4 


US-0 9-107-532A-35 67 


Sequence 35 67, Ap 




32 


94 


10. 


1 


2000 


1 


US-08-622-353-1 


Sequence 1, Appli 




33 


94 


10. 


1 


2000 


1 


US-08-622-352A-1 


Sequence 1, Appli 




34 


94 


10. 


1 


2000 


3 


US-08-826-390-1 


Sequence 1, Appli 




35 


93 . 5 


10. 


1 


501 


4 


US-09-252-991A-9107 


Sequence 9107, Ap 








9. 


9 


615 


4 


US-0 9-48 9-03 9A-4 55 9 


Sequence 4559, Ap 




37 


88.5 


9. 


5 


804 


4 


US-09-489-039A-3407 


Sequence 34 07, Ap 


c 


38 


88.5 


9. 


5 


1092 


4 


US-09-252-991A-9447 


Sequence 94 4 7, Ap 




39 


88.5 


9. 


5 


1149 


4 


US-09-252-991A-3630 


Sequence 3630, Ap 




40 


88.5 


9. 


5 


1287 


4 


US-09-252-991A-9384 


Sequence 9384, Ap 


c 


41 


88.5 


9, 


5 


1785 


4 


US-0 9-252-9 91A-9417 


Sequence 9417, Ap 




42 


88.5 


9. 


5 


1842 


4 


US-09-252-991A-9328 


Sequence 9328, Ap 




43 


88 


9. 


5 


80161 


3 


US-09-036-987A-1 


Sequence 1, Appli 




44 


88 


9. 


5 


80161 


3 


US-09-370-700-1 


Sequence 1, Appli 




45 


88 


9. 


5 


80161 


4 


US-09-603-207-1 


Sequence 1, Appli 



ALIGNMENTS 

RESULT 1 

-09-082-920-1 

Sequence 1, Application US/09082920C 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - nucleic search, using f rarae_plus_p2n model 



Run on: 



June 7, 2004, 02:26:35 



Search time 300 Seconds 
(without alignments) 
2858.851 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-941-945A-2 
929 

1 MADTERELADLVPQATAGDR 



VAQHRALTTLRSTLEQQENK 188 



Scoring table: 



BLOSUM62 



Xgapop 10.0 
Ygapop 10.0 
Fgapop 6 . 0 
Delop 6.0 



Xgapext 
Ygapext 
Fgapext 
Delext 



0.5 
0.5 
7.0 
7.0 



Searched : 



2995936 seqs, 2280998010 residues 



Total number of hits satisfying chosen parameters: 



5991872 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-M0DEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool/US09941945/runat_03062004_164704_5412/app query. fasta 1 . 32 
7 ~ ~ 

-DB=Published_Applications_NA -QFMT-fastap -SUFFIX=rnpb -MINMATCH=0 . 1 
-LOOPCL=0 -LOOPEXT=0 -UNITS=bits -START=1 -END— 1 -MATRIX-blosum62 
-TRANS=human4 0.cdi -LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX-100 
-THR_MIN-0 -ALIGN=15 -MODE^LOCAL -OUTFMT-pto -NORM-ext -HEAPSIZE=500 -MINLEN-0 
-MAXLEN=2000000000 -USER=US0994 194 5_@CGN_l_l_354_@runat_03062004_1647 04__54 12 
-NCPU=6 -ICPU=3 -NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK-100 
-LONGLOG -DEV__TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP-10 -XGAPEXT=0 . 5 
-FGAP0P=6 -FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP-6 -DELEXT=7 

Database : Published Applications NA: 



1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB. seq: ^ 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB . seq: * 

3 : /cgn2_6/ptodata/2/pubpna/US06__NEW_PUB. seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB. seq: * 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB. seq: * 

6 : /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB. seq: * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB. seq: * 

9 : /cgn2_6/ptodata/2/pubpna/US09A__PUBCOMB.seq: ^ 
10 : /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB. seq: * 
11 : /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB. seq: * 



12 
13 
14 
15 
16 
17 
18 
19 



/cgn2_6/ptodata/2/pubpna/US09_NEW_PUB. seq: ^ 
/cgn2_6/ptodata/2/pubpna/US09_NEW_PUB . seq2 : ^ 
/cgn2__6/ptodata/2/pubpna/US10A_PUBCOMB. seq: * 
/cgn2__6/ptodata/2/pubpna/US10B_PUBCOMB. seq: ^ 
/cgn2_6/ptodata/2/pubpna/US10C_PUBCOMB.seq: ^ 
/cgn2_6/ptodata/2/pubpna/US10_NEW_PUB. seq: * 
/cgn2_6/ptodata/2/pubpna/US60_NEW_PUB. seq: * 
/cgn2 6/ptodata/2/pubpna/US60 PUBCOMB . seq: ^ 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



Result 
No. 



Score 



Query 

Match Length DB 



SUMMARIES 



ID 



Description 





1 


926 


99. 


7 


564 


9 


US-09-738-626-669 


Sequence 669, App 




2 


926 


99. 


7 


1129 


9 


US-09-941-945A-1 


Sequence 1, Appli 




3 


926 


99. 


7 


3309400 


9 


US-09-738-626-1 


Sequence 1, Appli 




4 


397.5 


42. 


8 


573 


15 


US-10-156-761-4983 


Sequence 4983, Ap 




5 


397.5 


42. 


8 


9025608 


15 


US-10-156-761-1 


Sequence 1, Appl 




6 


210.5 


22. 


7 


774 


15 


US-10-156-761-4719 


Sequence 4719, Ap 


c 


7 


210.5 


22. 


7 


9025608 


15 


US-10-156-761-1 


Sequence 1, Appl 




8 


210 


22. 


6 


711 


15 


US-10-156-761-3372 


Sequence 3372, Ap 




9 


194.5 


20. 


9 


687 


15 


US-10-156-761-3374 


Sequence 3374, Ap 




10 


153.5 


16. 


5 


564 


9 


US-09-974-300-1354 


Sequence 1354, Ap 




11 


145.5 


15. 


7 


537 


15 


US-10-156-761-4690 


Sequence 4690, Ap 




12 


145.5 


15. 


7 


639 


9 


US-09-738-626-1241 


Sequence 1241, Ap 




13 


145.5 


15. 


7 


1330 


9 


US-09-935-757-1 


Sequence 1, Appli 




14 


145.5 


15. 


7 


2086 


9 


US-09-935-757-5 


Sequence 5, Appli 




15 


145 


15. 


6 


612 


15 


US-10-156-761-886 


Sequence 8 8 6, App 




16 


142 


15. 


3 


651 


15 


US-10-156-761-606 


Sequence 60 6, App 




17 


141 


15. 


2 


618 


15 


US-10-156-761-6054 


Sequence 6054, Ap 




18 


135.5 


14 . 


6 


577 


9 


US-09-974-300-1365 


Sequence 1365, Ap 




19 


135 


14. 


5 


615 


15 


US-10-156-761-521 


Sequence 521, App 




20 


121.5 


13. 


1 


657 


15 


US-10-156-761-3104 


Sequence 3104, Ap 




21 


118 


12. 


7 


537 


15 


US-10-156-761-5382 


Sequence 5382, Ap 




22 


118 


12. 


7 


618 


15 


US-10-156-761-6317 


Sequence 6317, Ap 




23 


117.5 


12. 


6 


537 


15 


US-10-156-761-4048 


Sequence 4048, Ap 




24 


116.5 


12. 


5 


75216 


15 


US-10-080-170-646 


Sequence 64 6, App 




25 


116 


12. 


5 


546 


15 


US-10-156-761-6559 


Sequence 6559, Ap 




26 


113 


12, 


2 


594 


15 


US-10-156-761-5107 


Sequence 5107, Ap 




27 


112.5 


12. 


1 


645 


15 


US-10-156-761-691 


Sequence 691, App 




28 


111 


11. 


9 


1830121 


15 


US-10-329-960-1 


Sequence 1, Appl 




29 


111 


11. 


9 


1830121 


16 


US-10-329-670-1 


Sequence 1, Appl 




30 


109 


11. 


7 


135638 


15 


US-10-314-657-1 


Sequence 1, Appli 




31 


108.5 


11. 


7 


603 


9 


US-09-738-626-3416 


Sequence 3416, Ap 




32 


108.5 


11. 


7 


1211 


9 


US-09-942-935-1 


Sequence 1, Appli 


c 


33 


108.5 


11. 


7 


1830121 


15 


US-10-329-960-1 


Sequence 1, Appl 


c 


34 


108.5 


11. 


7 


1830121 


16 


US-10-329-670-1 


Sequence 1, Appl 




35 


107.5 


11. 


6 


579 


9 


US-09-738-626-284 


Sequence 28 4, App 




36 


107.5 


11. 


6 


1002 


15 


US-10-156-761-3350 


Sequence 3350, Ap 




37 


107.5 


11. 


6 


1109 


9 


US-09-941-936A-1 


Sequence 1, Appli 


c 


38 


107.5 


11. 


6 


3309400 


9 


US-09-738-626-1 


Sequence 1, Appli 




39 


107 


11. 


5 


591 


15 


US-10-156-761-6314 


Sequence 6314, Ap 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - nucleic search, using f raine_plus__p2n model 



Run on : 



June 7, 2004, 01:22:34 



Search time 2348 Seconds 
(without alignments) 
2391.009 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-941-945A-2 
929 

1 MADTERELADLVPQATAGDR 



VAQHRALTTLRSTLEQQENK 18 8 



Scoring table: 



BLOSUM62 



Xgapop 10.0 
Ygapop 10.0 
Fgapop 6.0 
Delop 6.0 



Xgapext 
Ygapext 
Fgapext 
Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



27513289 seqs, 14931090276 residues 



Total number of hits satisfying chosen parameters: 



55026578 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-M0DEL=frame+_p2n. model -DEV=xlh 

Q-/cgn2_l/USPTO_spool/US0994194 5/runat_03062004_164702_5307/app_query.fasta 1 
7 

-DB-EST -QFMT=fastap -SUFFIX-rst -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS-bits -START-1 -END=-1 -MATRIX=blosum62 -TRAISfS=human4 0 . cdi -LIST-45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN-0 -ALIGN=15 -MODE=LOCAL 
-OUTFMT^pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN-2 000 000000 
-USER=US09941945_@CGN_l_l_2607_@runat_03062004_164702_5307 -NCPU-6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES-0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=12 0 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT-0 . 5 -FGAPOP- 
-FGAPEXT=7 -YGAPOP-10 -YGAPEXT-0 . 5 -DEL0P=6 -DELEXT=7 

Database : EST:-*^ 



1 : em_estba : * 

2: em_esthum:^ 

3: em_estin:* 

4: em^estmu:^ 

5: em_estov:* 

5: em_estpl:* 

7: em_estro:"^ 

8 : em htc : ^ 



9: gb_estl:^ 
10: gb_est2:* 
11: gb^htc:^ 



12 


: gb 


_est3: ^ 




13 


: gb 


est4 : * 




14 


: gb 


est5 : ^ 




15 


: em 


estf un : ^ 




16 


: em 


estom: ^ 




17 


: em 


gss hum: 


+ 


18 


: em 


_gss_inv: 




19 


: em 


gss pin: 




20 


: em 


gss vrt : 




21 


em 


gss fun: 




22, 


em 


gss mam: 




23: 


em 


gss mus : 


•k 


24 : 


em 


gss pro: 


kr 


25: 


em 


gss rod: 


ik 


26: 


em 


gss phg: 


+ 


27: 


em 


gss vrl : 


kr 


28: 


gb" 


gssl : ^ 




29: 


gb 


gss2 : ^ 





Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 





1 


120 


12. 


9 


879 


11 


BC022210 


BC022210 


Homo sapi 


c 


2 


110 


11. 


8 


1012 


28 


BZ573065 


BZ573065 


msh2 2928 




3 


109.5 


11. 


8 


763 


13 


BQ968973 


BQ968973 


QHB35M11. 




4 


108.5 


11. 


7 


698 


13 


BQ916398 


BQ916398 


QHB17M04 . 




5 


108.5 


11. 


7 


728 


13 


BQ916763 


BQ916763 


QHB19B03. 




6 


108.5 


11. 


7 


762 


13 


BQ969036 


BQ969036 


QHB35P11. 




7 


107.5 


11. 


6 


701 


13 


BQ914474 


BQ914474 


QHB10K23. 




8 


107 . 5 


11. 


6 


762 


13 


BU027838 


BU027838 


QHG8B12.y 




9 


107. 5 


11. 


6 


769 


13 


BQ916644 


BQ916644 


QHB18I24 . 




10 


105.5 


11. 


4 


739 


13 


BQ916439 


BQ916439 


QHB17N22. 




11 


103.5 


11. 


1 


717 


13 


BQ970657 


BQ970657 


QHB42L15. 




12 


102.5 


11. 


0 


1115 


28 


BZ553599 


BZ553599 


pacsl-60 




13 


101.5 


10. 


9 


743 


13 


BQ915857 


BQ915857 


QHB15P24 . 




14 


101.5 


10. 


9 


893 


13 


BQ931778 


BQ931778 


AGENCOURT 




15 


100.5 


10. 


8 


758 


13 


BQ917182 


BQ917182 


QHB20I24 . 


c 


16 


100.5 


10. 


8 


992 


28 


BZ552520 


BZ552520 


pacsl-60 




17 


99 


10. 


7 


498 


13 


BQ272709 


BQ272709 


sao20d037 


c 


18 


95.5 


10. 


3 


761 


28 


BZ558190 


BZ558190 


pacsl-60 




19 


94.5 


10. 


2 


816 


28 


BZ559495 


BZ559495 


pacs2-164 




20 


94 


10. 


1 


634 


9 


AV934201 


AV934201 AV934201 




21 


93 


10. 


0 


648 


12 


BJ471207 


BJ471207 


BJ471207 


c 


22 


92.5 


10. 


0 


483 


14 


CA548840 


CA54 88 4 0 


C0810G09- 




23 


92.5 


10. 


0 


753 


13 


BU023549 


BU023549 


QHF11L03. 




24 


91.5 


9. 


8 


229 


28 


AZ578624 


AZ578624 


25f08 Sho 




25 


91.5 


9. 


8 


767 


10 


BF862931 


BF862931 


963039F08 




26 


90.5 


9. 


7 


652 


13 


BQ915718 


BQ915718 


QHB15I21. 




27 


90 


9. 


7 


1001 


29 


CG348903 


CG348903 


OGWGL80TV 




28 


89 


9. 


6 


2636 


11 


AK002623 


AK002623 


Mus muscu 


c 


29 


88.5 


9. 


5 


501 


14 


CB972586 


CB972586 


CAB30001 



